High-Dimensional Sparse Econometric Models, an Introduction
نویسنده
چکیده
In this chapter we discuss conceptually high dimensional sparse econometric models as well as estimation of these models using l1-penalization and postl1-penalization methods. Focusing on linear and nonparametric regression frameworks, we discuss various econometric examples, present basic theoretical results, and illustrate the concepts and methods with Monte Carlo simulations and an empirical application. In the application, we examine and confirm the empirical validity of the Solow-Swan model for international economic growth. 1 The High Dimensional Sparse Econometric Model We consider linear, high dimensional sparse (HDS) regression models in econometrics. The HDS regression model has a large number of regressors p, possibly much larger than the sample size n, but only a relatively small number s < n of these regressors are important for capturing accurately the main features of the regression function. The latter assumption makes it possible to estimate these models effectively by searching for approximately the right set of the regressors, using l1-based penalization methods. In this chapter we will review the basic theoretical properties of these procedures, established in the works of [8, 10, 18, 17, 7, 15, 13, 27, 26], among others (see [20, 7] for a detailed literature review). In this section, we review the modeling foundations as well as motivating examples for these procedures, with emphasis on applications in econometrics. Let us first consider an exact or parametric HDS regression model, namely, Alexandre Belloni Duke University, Fuqua School of Business, 100 Fuqua Drive, Durham, NC, e-mail: [email protected] Victor Chernozhukov Massachusetts Institute of Technology, Department of Economics, 50 Memorial Drive, Cambridge, MA e-mail: [email protected]
منابع مشابه
Inference for high-dimensional sparse econometric models
This article is about estimation and inference methods for high dimensional sparse (HDS) regression models in econometrics. High dimensional sparse models arise in situations where many regressors (or series terms) are available and the regression function is wellapproximated by a parsimonious, yet unknown set of regressors. The latter condition makes it possible to estimate the entire regressi...
متن کاملPower Enhancement in High Dimensional Cross-Sectional Tests.
We propose a novel technique to boost the power of testing a high-dimensional vector H : θ = 0 against sparse alternatives where the null hypothesis is violated only by a couple of components. Existing tests based on quadratic forms such as the Wald statistic often suffer from low powers due to the accumulation of errors in estimating high-dimensional parameters. More powerful tests for sparse ...
متن کاملTPS 2013 (1).indd
Modeling of complex systems is commonly confronted with high dimensional set of independent variables. Similarly, econometric models are usually built using time series data that often exhibit nonstationarity due to the impact of some policies and other economic forces. In both cases, linear regression modeling may yield unstable least squares estimates of the regression coeffi cients. Principa...
متن کاملRobust Estimation in Linear Regression with Molticollinearity and Sparse Models
One of the factors affecting the statistical analysis of the data is the presence of outliers. The methods which are not affected by the outliers are called robust methods. Robust regression methods are robust estimation methods of regression model parameters in the presence of outliers. Besides outliers, the linear dependency of regressor variables, which is called multicollinearity...
متن کاملMammalian Eye Gene Expression Using Support Vector Regression to Evaluate a Strategy for Detecting Human Eye Disease
Background and purpose: Machine learning is a class of modern and strong tools that can solve many important problems that nowadays humans may be faced with. Support vector regression (SVR) is a way to build a regression model which is an incredible member of the machine learning family. SVR has been proven to be an effective tool in real-value function estimation. As a supervised-learning appr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011